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Abstract 



Consider a source £ of pure quantum states with von Neumann entropy S. By 
the quantum source coding theorem, arbitrarily long strings of signals may be encoded 
asymptotically into S qubits/signal (the Schumacher limit) in such a way that entire 
^C^' \ strings may be recovered with arbitrarily high fidelity. Suppose that classical storage is 

free while quantum storage is expensive and suppose that the states of £ do not fall into 
OA ' two or more orthogonal subspaces. We show that if £ can be compressed with arbitrarily 

^ . high fidelity into A qubits/signal plus any amount of auxiliary classical storage then 

' A must still be at least as large as the Schumacher limit S oi £. Thus no part of the 

. quantum information content of £ can be faithfully replaced by classical information. 

^-H ' If the states do fall into orthogonal subspaces then A may be less than S, but only by 

T-H . an amount not exceeding the amount of classical information specifying the subspace 

' for a signal from the source. 

o ■ 

1 Introduction 

C ■ 

d \ The quantum source coding theorem ^|S1 12111113 provides one of the clearest manifestations 

of the concept of quantum information. It characterises the minimal resource (in terms of 
• • , Hilbert space dimension) that is sufficient to faithfully represent long sequences of signal 

\ emissions from a memoryless quantum source. This provides a notion of the quantum 

^ ' information content of the source and the minimal resource is given by the Schumacher 

. Umit - S qubits/signal - where S is the von Neumann entropy of the source. In this paper 

we consider a possible refinement of this theorem, asking to what extent the quantum 
information may be represented in two parts - a classical part and a quantum part - such 
that the quantum part is minimised while the classical part may be as large as desired. We 
will show that it is impossible to reduce the resource of the quantum part to below the 
Schumacher limit, except in the special case that the signal states fall into two or more 
orthogonal subspaces. Thus in general (i.e. with the preceding exception) it is impossible 
to substitute classical information for any part of the quantum information of a source. 

The paper is organised as follows. The main results are given in theorems ^ and |21 of 
section HI We approach the proofs of these results through a sequence of lemmas after 
establishing some preliminary definitions and terminology. In section 21 we provide a formal 
definition of a coding-decoding scheme which applies to blocks of signals of general length 
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n. We define the fidelity of any sucli sclieme and state Schumaclier's quantum source coding 
theorem. In section|21we introduce the distinction between reducible and irreducible sources 
i.e. sources whose signal states respectively do or do not fall into orthogonal subspaces. This 
distinction is fundamental for our main results and we give an alternative characterisation 
of it which is used in our subsequent proofs. 

In section |1] we refine the concept of coding-decoding schemes to a situation in which 
the encoding has a classical part and a quantum part. In terms of this concept we briefly 
review earlier work of jH] which provided the motivation of our present study, and we give 
a precise statement of our main new results. For any such refined coding-decoding scheme 
the classical part of the encoding may be assumed to remain intact after the input signal 
blocks have been reconstructed (with some fidelity) by decoding. Correspondingly in section 
El we begin the proof of our main results by considering the classical mutual information Z 
between the identity of the input string and the classical part of the encoding. If the input 
string has length n and Q denotes the number of qubits needed to support the quantum part 
of the encoding, then we prove that (Q + X) /n cannot remain less than the Schumacher 
limit as the fidelity of the coding-decoding scheme approaches unity. Finally to provide 
a lower bound for Q/n, in section El we study the behaviour of I/n for irreducible and 
reducible sources. We prove that I/n must tend to zero for any irreducible source as the 
fidelity of the coding-decoding scheme tends to unity. For reducible sources the situation is 
more complicated: clearly it is possible to at least determine the identity of the orthogonal 
subspace to which a given signal belongs, without disturbing the signal. We prove that this 
is the best we can do i.e. that I/n cannot exceed the amount of classical information about 
the signal provided by this identification. 

These lemmas in section (HI are mathematically precise examples of a heuristic principle 
in quantum information theory viz. that it is impossible to obtain information about the 
identity of a quantum state from an irreducible source without irreparably disturbing the 
state and furthermore, that there should be a trade-off between the amount of disturbance 
and the amount of information gained. Such information-disturbance results have been 
derived in other situations but for us there are extra technical complications arising 
from the fact that the block length n must generally increase unboundedly as the fidelity 
of the coding-decoding scheme tends to unity i.e. we have a situation in which the source 
varies as the disturbance tends to zero. Our lemmas in section El referring to a situation 
of unboundedly increasing block lengths, may have a wider applicability for example to the 
study of the security of quantum cryptographic protocols, in which an eavesdropper may 
attempt to extract classical information from blocks of signal transmissions. 

In section |71 we draw together the lemmas of the preceding sections to give proofs of our 
main results. Finally in section |S1 we summarise our findings and discuss some related open 
questions. 
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2 Preliminary definitions 



We begin with a more precise statement of the quantum source coding theorem which will 
also serve to establish terminology and notations for our main results. We sometimes denote 
the ensemble or source of (generally mixed) states with prior probabilities pi as {^i;Pi}- 
Consider a source S = {\ai) ;pi} of pure quantum signal states \ai) with prior probabilities 
Pi. We will use capital letter indices to denote multi- indices for blocks of signals of length 
n: 

Wi) = kii) . . . (1) 

PI = Pi,... pi„ (2) 
I = ii...in (3) 

We will often write the projector \aj){aj\ simply as aj. Let Ti. denote the Hilbert space of 
single signals, of dimension k, and let Ba denote the space of all mixed states of a qubits 
(or the smallest integer greater than a if a is not an integer) . Then n-strings aj are in 7^ 
and in Bniogk- In this paper, logarithms are always to base 2. 

If and p are any pure and mixed state respectively in the same state space, we define 
the fidelity F by 

F{\iP){^P\,p) = {^P\p\^P) (4) 
More generally if uj and p are mixed states we define the fidelity by |13| I14j 



F(p,u;) = (^tr ^V^pV^j (5) 

The von Neumann entropy 5 of iS is defined by 

S = — tr plog p (6) 

where p = J2iPi is the overall density matrix of the signal states. 

An encoding-decoding scheme for blocks of length n, to a qubits/signal and average 
fidelity 1 — e, is defined by the following ingredients: 

(i) an encoding operation En : "H®^ — > Bna which is a completely positive trace preserving 
map (a CPTP map).^ En{(Ji) is a (mixed) state of na qubits called the encoded or com- 
pressed version of aj. 

(ii) a decoding operation D„ ■ Bna Bniogk which is also a CPTP map. We write 
CTj = DnEn{crj) and call it the decoded version of aj. Note that ai is generally a mixed 
state. 

(iii) the average fidelity between the cr/'s and a/'s is 1 — e: 

Y,PiF{ai,ai) = l-e (7) 



^ These encoding operations are called blind, in contrast to visible encodings in which En is allowed to 
be an arbitrary map, but D„ in (ii) is still required to be CPTP. See W for a further discussion of this 
distinction. Note that the visible situation is trivial for our main problem: all of the information about the 
input may be faithfully extracted in classical form by recording the identity of the input labels. 
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We say that the source E may be compressed to a qubits / signal if the following condition 
is satisfied: for all e > there is an no such that for all block lengths n > tiq there is an 
encoding-decoding scheme for blocks of length n to a qubits / signal and average fidelity at 
least 1 — e. We can now make the following precise statement. 

Quantum source coding theorem. [H El il Let 5 be the von Neumann entropy 
of a source £ of pure quantum states and suppose that a ^ S. Then £ may be compressed 
to a qubits/signal if and only if a > 5. ■ 

3 Reducible and irreducible sources 

For our main results it will be important to classify sources according to whether or not 
they decompose into orthogonal parts in the following sense: 

Definition 1 A source £ is called reducible if its signal states fall into two or more orthog- 
onal subspaces. Otherwise £ is called irreducible. 

If £i = {aij]pij}j for i = 1, . . . ,L are sources of signals lying in mutually orthogonal 
subspaces then we may construct the reducible source £ = \Jiai£i = {aij;aiPij}ij where 
{oi, . . . , ol} is any chosen probability distribution (and the subscript outside the bracket is 
the index labelling the signal states). Conversely any reducible source £ may be decomposed 
into irreducible parts £ = [j ai£i by choosing a maximal orthogonal decomposition. Here 
ai is the total probability of all states of £ lying in the i^^ orthogonal subspace and £i 
comprises these states with suitably renormalised probabilities. 

We give an alternative characterisation of irreducibility of a source of pure states which 
will be used in our later proofs. 

Definition 2 If\(y) and \t) are any signal states, a chain from \ct) to |r) of length m is a 
sequence of signal states \cTi) beginning with \a) and ending with \t): 

\a) = |cri) , |(T2) . . . , |<Tm) = \t) 

such that (iTj|(Tj-|-i) ^ for all i = 1, . . . , m — 1. 
Lemma 1 Let £ be an ensemble with K signal states. 

(a) £ is irreducible if and only if for any two signal states \a) and |r) there is a chain from 
\a) to \t). 

(b) If there is a chain from \a) to \t) then there is a chain of length at most K. 

Proof We will prove the contrapositive form of (a). Thus suppose that the signal states 
do fall into two orthogonal subspaces Ei and E2. Let \a) G Ei and |r) G E2. Then any 
chain from \a) to |r) would have a jump from Ei to E2 at some stage. But this is impossible 
so there can be no chain from \a) to |t). Conversely suppose that there is no chain from 
\a) to |t). Let M^- be the set of all signal states that are reachable from \a) by chains. Let 
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be the complement. Thus \a) € Mq- and |t) G M^, so both sets are non-empty. Now 
any |t') E is orthogonal to all signals in Mo- (since if (c'|r') 7^ for some |cr') G Mg- we 
would have a chain from |cj) to \a') that extends to |r'), which is impossible). Let Ei and 
E2 be the linear span of signals in Mq- and respectively. Then Ei and E2 are orthogonal 
subspaces containing all the signal states i.e. £ is reducible, 
(b) Suppose that a chain from \a) to |r) contains some signal \a') twice: 

\a) ,...,\a') ,...,\a') ,...\t) . 

Then we may delete the section between the two |o"')'s and still have a chain. Hence if there 
is a chain there is also a chain that contains each signal at most once i.e. having length at 
most K. ■ 

Example £ may contain orthogonal states yet still be irreducible. Minimal chains may 
need to have maximal length K. Consider for example £ with K = 5 states given by 
|0) , |0) + |1) , |1) + |2) , |2) + |3) , |3). Then £ is irreducible. |0) is orthogonal to |3) and the 
shortest chain between them has 5 members. ■ 

4 Coding with a classical and quantum part 

In the context of quantum information theory, classical information may be thought of as a 
special case viz the quantum information of a source of states that are known (or required) 
to always be members of a prescribed orthonormal basis. More generally we may consider a 
quantum register as holding only classical information (relative to a prescribed orthonormal 
basis) if there is an omni-present fully decohering operation acting on the register, diagonal 
in the basis, which prevents the occurrence of any non-trivial superpositions of the basis 
states or any entanglements of this register with any other quantum registers being consid- 
ered. Thus the most general allowable ("classical") state of the register is a probabilistic 
mixture of the basis states, which may be classically correlated to the quantum state of 
all other registers (cf eq. © below). These conditions endow classical information with 
special properties not shared by quantum information in general. For example classical 
information is robust compared to quantum information - it may be readily stabilised and 
corrected by frequent measurement in the given basis, which would destroy genuine quan- 
tum information. Also, unlike quantum information, it may be cloned or copied. These 
and other singular properties indicate that for many purposes it is useful to regard classical 
information as a separate resource, distinct from quantum information. In this vein, it is 
natural to ask if the quantum source coding theorem may be refined along the lines outlined 
in the opening paragraph of section ^ (and formulated precisely below). 

Since our compression schemes are required to operate with arbitrarily high fidelity 
(i.e. reproduce the source states arbitrarily well as e — > 0) the question of whether part 
of the quantum information of the source may be represented in classical terms, may be 
alternatively phrased as the question of whether it is possible to reversibly extract classical 
information from a quantum source in such a way that the residual quantum information 
content is reduced. This question has already been raised in [Hj and in [3] (at the end of 
chapter 1). 
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Consider an encoding operation En which encodes \ai) into two registers A and B 
where A holds the classical part and B holds the quantum part of the encoding. Let {|j)} 
be the classical orthonormal basis of A. The most general allowable classical state in A is 
a probability distribution over j values so the most general form of the encoded state may 
be written 

En\aj)=Y,c^\j){j\^u;j (8) 
j 

where are some (generally mixed) states of the subsystem B. Here = p{j\I) is the 
probability of having j in A given that we are encoding the I^^ input string. An encoding 
operation of this type can be physically interpreted as the action of an (incomplete) quantum 
measurement on \cri). In this case j is the measurement outcome and uJj is the post 
measurement state (after possible further processing in a way that can depend on the value 

ofi). 

^From Pi and pj = J2i Pij\^)Pi have p{I\j) = p{j\I)pi /Pj and for each fixed value of 
j we get the ensemble 

8j = WMl\j)]. (9) 

Let suppj be the least number of qubits/signal required to support the states in 8j. Then 
the quantum resource of the encoding is defined to be 

supp = suppj (10) 

i 

We will say that a source £ may be compressed to a qubits/signal plus auxiliary classical 
storage if for all e > there is an no such that for all n > no we have an encoding-decoding 
scheme {En,Dn) with fidelity 1 — e and supp = a. 

In terms of the above notions the main result of 'H| may be stated as follows. 

Proposition 1 (^81) Let £ be any irreducible source of pure states. Let S be the von 
Neumann entropy of £ and let En be any encoding scheme for blocks of length n from E 
having a classical and quantum part as in eq. ^ above. Suppose further that 

(a) the states Uj in the encoding are all pure states, 

(b) the coding scheme works with fidelity 1 i.e. the states \ai) may be perfectly reconstituted 
from their encoded versions. 

Then the von Neumann entropy of each ensemble £j is nS. 

Hence under the assumptions (a) and (b) it is impossible to reduce the quantum resource 
of the encoding below the Schumacher limit S qubits/signal of the original source, by any 
procedure that extracts classical information, since S is also the Schumacher limit per input 
signal for each 8j. 

The restrictions (a) and (b) are in fact very severe. In particular a requirement of 
perfect fidelity (as in (b)) would rule out many basic theorems of information theory. The 
compression of classical information given by Shannon's source coding theorem and the 
compression given by the quantum source coding theorem, for example, would both be 
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impossible. Thus it is of great interest to require only the weaker condition of asymptotically 
perfect fidelity i.e. fidelity of 1 — e for all e > where decreasing e will generally involve 
working with increased block lengths n. This question was raised in ^ but left open. 

We now consider the most general situation where both restrictions (a) and (b) are 
lifted. Our main results are given in theorems ^ and [21 below. 

Theorem 1 Let E = &e an irreducible source of pure states with von Neumann 

entropy S and suppose that a ^ S. Then E may be compressed to a qubits per signal plus 
auxiliary classical storage if and only if a > S. 

Thus no part of the quantum information content of £ may be represented in classical 
terms if £ is irreducible. However this is no longer true if the source is reducible as shown 
by the following example. 

Example Consider a reducible ensemble £ = aiSi U a2<f2 where {01,02} is a probability 
distribution and £{ are irreducible, supported in orthogonal subspaces Ei respectively. If pi 
is the density matrix of £i then the density matrix p of £ has a block diagonal form 



where -ff (01,02) is the Shannon entropy of {01,02}. 

We have the following encoding scheme for £: in a long string, measure each signal 
(without disturbance) to determine whether it is in Ei or E2. This provides H (01,02) 
bits/signal of classical information. For each value of i, Schumacher compress the signals 
lying in Ei to S{pi) qubits/signal. The total quantum resource on average^ is aiS{pi) + 
o.2S{p2) qubits/signal which is less than S{p) by the amount of the classical information 
extracted. Clearly the original string may be reconstituted with arbitrarily high fidelity (for 
suitably large block lengths) from the classical and quantum parts of the encoding. Thus 
if the original ensemble £ is reducible, it is always possible to convert part of its quantum 
information into classical information. In theorem \^ we will see that the above scheme 
is actually optimal for providing the minimal quantum resources in any classical-quantum 
compression scheme for a reducible ensemble. ■ 

Theorem 2 Let £ = U^i ^l^l ^^^2/ reducible ensemble with von Neumann entropy S 
where £1 are irreducible subensembles supported in orthogonal subspaces. Let Si be the von 
Neumann entropy of £1 and suppose that a 7^ ^lUiSi. Then £ may be compressed to a 
qubits per signal plus auxiliary classical storage if and only if 



P = Oipi © O2/O2 



so the Schumacher limit of £ is 



S{p) = H{ai,a2) + oi5(pi) + a2S{p2) 




^To make this statement precise we need to invoke properties of typical sequences as given in the proof 
of theorem |5| later. 
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Note that if we do not require the residual quantum resource of the encoding to be 
smaller than that of the original source then reversible extraction of classical information 
(even with perfect fidelity) is always possible. Indeed as described in ^ the process of 
quantum teleportation may be interpreted as a scheme for encoding a quantum source into 
classical and quantum information with perfect fidelity in decoding, but the associated 
quantum resource of the encoding is not less than that of the original source. Also one may 
consider the trivial encoding of retaining the input string untouched and merely attaching 
independent classical information, which is discarded in the decoding. 

We will approach the proofs of theorem ^ and theorem El through a series of lemmas. 
Firstly lemma [2 below will relate the quantum resource of any encoding-decoding scheme 
to the amount of mutual information per signal, between the classical part of the encoding 
and the identity of the input string. Then we will use lemmas El and to show that this 
mutual information must tend to zero for irreducible sources, as the fidelity of the scheme 
tends to unity, and we will also characterise its limiting value for reducible sources. 



5 Mutual information of the classical extraction 

Lemma 2 Let £ be a source with von Neumann entropy S. Let {En,Dn) be any encoding- 
decoding scheme for blocks of length n with average fidelity 1 — e. Suppose that the encoded 
states have a classical and quantum part as in eq. Let supp be the quantum resource 

of the encoding (as in eq. ilCH) ) and let I{L : J) denote the mutual information between 
the input string L and the classical data j i.e. the mutual information of the probability 
distribution p[ISzj) = Pi Cj . Then 

SUpp + ^^^>5-/(6) (11) 

n 

where /(e) is a function satisfying 

/(e) ^0 as e ^ 0. 

Remark We will actually prove a slightly stronger statement. Let {E, D) be an encoding- 
decoding scheme for a source £in in dimension d with von Neumann entropy Sin. Let Supp 
be the average number of qubits needed to support the intermediate ensembles £j and let 
/ denote the identity of the input state. Then 

S^ + I{I:J)>Sin-f{e)logd (12) 

In lemma Sin has the form (i.e. n-strings from £) so Sin = ^^-S*, Supp = nsupp and 
d = where k is the dimension of the single signal space. 

Proof Let us write the encoded states as 

ri = En \ai) = Y.p{j\L) ® u;]. (13) 

i 
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These states ah have a block diagonal form with blocks labelled by j containing p{j\I)u!j. 
Consider the ensemble Senc = {ti]Pi}- The average state is r = J2iPi'^i the Holevo 
quantity of Senc is 

Xenc = S(t) - ^p/5'(r/). 
I 

If we fix on any single value of j we have the ensemble £j = {ujj;p{I\j)}. Let Xj be the Holevo 
quantity of £j. Using the block diagonal form of S^nc a straightforward rearrangement of 
the formula for x gives 

Xenc = ^PiXi+2:(/: J). (14) 

j 

For any ensemble the Holevo quantity satisfies x ^ log d where d is the dimension of the 
space of states for the ensemble. Hence 

Xj < n suppj for all j. (15) 

and from eqs. (|15|) and (|14|) 

n SUPP+ I {I : J) > Xenc- (16) 

Now consider the decoding stage. We have 

Dniri) = a I 

with average fidelity 

F = Y,pjF{\ai),aj) = l-e. 
I 

Let Xdec be the Holevo quantity of the decoded ensemble Sdec = {c^/;^/} and let Xin = ""S* 
be the Holevo quantity of the input ensemble £ = {\cri) :,pi}. We will use the result, proved 
in appendix A, that high fidelity ensembles have close x's- More precisely, since £in and 
Sdec (supported in dimension d = k"' where k is the dimension of the single signal space) 
have fidelity 1 — e we can say 

\Xin - Xdecl < 4(V^log(A;") - V^log(2V^)) (17) 

so 

Xdec >nS- 4n^/~e log k + 4^6 log(2 V^) (18) 

Now the decoding operation Dn is a CPTP map and by the Uhlmann-Lindblad monotonicity 
theorem |1U1 lllj the Holevo quantity is non-increasing under any CPTP map. Thus Xenc ^ 
Xdec and eqs. (flT)|) and (|18|) give 

n supp + J(/ : J)>nS - Anyfe log k + 4A/e log(2^) 

so 

SUpp + ^^>5-/(6) 

n 

where /(e) ^ as e — > 0, as required. ■ 
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6 An information— disturbance relation 



To complete the proof of theorem ^ we wih argue that I [I : J) /n must also tend to zero as e 
tends to zero. The intuitive reason is the following. After encoding and decoding (thinking 
of e as being very small) the states aj reproduce the states |cr/) with high fidelity. But the 
classical data j can be assumed to remain after the process since it may be copied at the en- 
coding stage into another register which is not affected by the decoding operation. Now it is 
a general heuristic principle in quantum physics that one cannot obtain information about a 
fixed source of non-orthogonal states without disturbing them and furthermore there should 
be a tradeoff between the amount of disturbance and the amount of information gained. 
The fundamental role of information-disturbance tradeoffs in quantum measurement theory 
has been emphasised by C. A. Fuchs [H]. However we need a more refined version of this 
principle as our input source generally varies (because of increasing block lengths) as the 
disturbance e tends to zero. Nevertheless we will show that X(I : J)/n, the information 
gained per signal, goes to zero as the fidelity approaches 1. 

Indeed in the limiting case of perfect fidelity (i.e. where aj = \crj){aj\ ) it is not difficult 
to show that I {I : J) must be exactly zero (cf [^). The proof is as follows: Any CPTP 
map may be represented as a unitary operation acting on the input together with an ancilla 
(in some standard initial state |0)) followed by tracing over a subsystem of the output. Thus 
the encoding and decoding operation on \aj) may be represented as a unitary operation U 
on \(Ji)j^ \ ^)b registers A and B where B is the ancilla, yielding a pure state 

^(k/)A|0)B) = |A/)^B 

where a/ = tr ^ |A)(A| and the classical data is obtained from a subsystem of tr |A)(A|. If 
the |cj/)'s are reproduced with perfect fidelity we must have 

\^)ab = Wi)a\'I'i)b 
for some pure states But then from the unitarity of U 

(cT/|c7x)(0|0) = {ai\aK){Hi^K)- 

Hence if there is a chain from |c7/) to {(Tk) we must have (ipilipK) = 1 i-e. \tpi) = {iPk)- If 
£ is irreducible then this is true for all / and K so no measurement on register B can yield 
any information about the identity of /. In particular X(I : J) must be zero. 

In lemmas IHl and El below we will generalise the above argument to the scenario of 
arbitrarily high (but not perfect) fidelity, showing that 2{I : J)/n ^ as e ^ 0. 

Lemma 3 Suppose that £ = {\ai) ; pi} is an irreducible source with K states. Suppose that 
the states \ai) are provided in a register A with state space Bai and let register B he an 
ancilla with state space Ba2- We will refer to B as the environment. Let 

T ■.Ba,(S) Ba, Ba^ (19) 

^W^)A\0)B = \^'^AB (20) 
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be a unitary map such that 

Y,p,F{\a,),tT Bmm = l-e (21) 

i 

Let {pi = ti A ',Pi} be the environment ensemble and let 

X = s{J2pipi) - J2PiS{pi) 

i i 

be the Holevo quantity of the environment. Then if E is kept fixed but e, T and 02 ff^e 
allowed to vary, we have x < /(e) where the function f satisfies /(e) as e ^ 0. In fact 
we may take /(e) = A^/e + By/elog y/e where A and B are constants. 

Remark We are thinking here of T as being a unitary extension of a CPTP coding-decoding 
map D^En with high fidehty 1 — e. Note that any coding-decoding scheme for any source 
may be assumed to be of the form F in eq. (|19() where register A contains the final decoded 
state and the environment B may, without loss of generality, be assumed to retain a copy of 
the classical part of the encoding (since it may be copied after encoding and the copy kept 
intact during decoding). Thus tr b is the decoded version of the input \ai) and by 

Holevo's theorem jl5j x of the environment is an upper bound for the amount of information 
about i that may be obtained by any measurement on B. Thus the lemma states that any 
such information must approach zero as the average disturbance to the ensemble tends to 
zero. 

The proof of lemma 01 is given in appendix B. 

Note that in lemma 01 the source is kept constant as e varies: there is no notion of 
increasing block length as e — > 0. For our desired application in theorem ^ the ensemble 
£ varies as e — > since the block length generally increases. The proof of lemma |31 is not 
directly applicable in this situation (as the parameters K and C also increase with block 
length) and this extra complication is dealt with in lemma below. As a preliminary result 
we have: 

Lemma 4 (Markov Lemma) Let {Xi;pi} be any random variable with < < 1 and 
mean X > 1 — e. Then for any A we have Prob{Xi < 1 — Ae) < ^. In particular 

Prob{Xi < 1 - V^) < Re- 
proof If Prob{Xi < 1 — Ae) = a we get 

l-e<Y,PiXi= Pi^i+ PiXi < {I - Ae)a + {1 - a)l = 1 - Aae 

Xi<l-At Xi>l-A€ 

Hence Aa < 1. ■. 

Lemma 5 Suppose we have a sequence {cm > 0} with e^ ^ and let n{m) be any (gen- 
erally unbounded) function of m. Suppose also that for each m we have: 
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(i) a source 8^'^'> = Ei^"* 03 ... ^'^m) ^^ere each £^^^ is an irreducible source on a state 
space of at most k dimensions with at most K signal states. 

(a) An encoding- decoding scheme {E^'^\ D^'^''^) on S^™"^ with average fidelity 1 — em, leaving 
the environment in a state pf^^ for input state labelled by I = ii . . . in(m) ■ 

Then < 9{^m) where g is a function satisfying g{e) ^ as e — > 0. 

Hence the amount of information per position tends to zero as the fidelity tends to 1, for 
arbitrarily changing block lengths in the schemes. 

Remark In our application below of lemma IS] to an irreducible ensemble £, S^"^^ will be 
the ensemble of strings of length n(m) from £ so sj^"^^ = £ for all i and m. For reducible 
ensembles £ however, it will be necessary to consider the irreducible parts of the ensemble 
of n(m)-strings so that each £'f^^ will range over the various irreducible subensembles of £ 
and we will need lemma [S] in its full generality. 

Proof of lemma [S] Let us fix attention on any one of the schemes labelled by m and 
omit reference to the value of m in all labels, for notational clarity. We write I-^i for the 
input string with the l^^ position deleted. 

Let Ti-^...i„ be the decoded output for the input string |cri^...i„) = |crj^) . . . |crj„) G £±0. . . £n 
and write 

Fl = Fii...in = {o'ii...in \ '''h-.-in • 

Let Pi = Pi^...in = Pil^ ■ ■ -Pi^^ where pf^'' is the probability of |cJi) in the ensemble £k. For 
notational clarity we will henceforth omit the superscript {k) on the probabilities. Then 

^ = E = 1 - e- (22) 

il-.-in 

Let {pr,pi} be the ensemble of final environment states of the coding-decoding scheme. 
We will use the following inequality, proved in appendix C, for the Holevo quantity of the 
environment: 

x{{pi;pi}) 



< maxEw^feX/^fc, (23) 

where 



^I^k = SiY^PikPl) -Y.PikS{pl) 

ik ik 

and we will argue that each term on the RHS of eq. (|23|) tends to zero with e. 

Consider k = 1 (all others are similar). For each fixed choice of J^i = i2...in we 
extend \ai) in the Ii slot by \cri2...i„)i apply the operation DE, and look at the Ii slot of 
the output. This is a coding/decoding of |o"j) (i.e. length 1 string from £i) with output 
Tj = tr i2...i„Tii2...i„- Furthermore Xl^i is the Holevo quantity of the environment after this 
coding-decoding of £i . The fidelity is 

F^'^-'"^ = EiPii'^ilnWi) 

= J2iPi{(^i\^T^ 12: ■i„Tii2...i„\cri) (24) 

— l^iPi {'^ii2...in \ Tii2---in Wii2---in) ~ '^i PiFii2...i„ 
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(Here the last inequality arises since we can extend \o'i2...in) to an orthonormal basis of the 
I2 ■ ■ - In slots to perform the partial trace.) 

Next we apply the Markov lemma to the random variable 

{^12. ..in = = '^^PiFii^ i^ ; Pi2...in\ 

i 

(noting that eq. (|22|) gives X > 1 — e) to conclude: 

^PiFii2...in < 1 - \/e with probability < y/e. 

i 

Divide strings 12 ■ ■ - in (taken with probabilities Pi2...i„) into 

Sgood = {i2---in\Y^ PiFii2...i„ > 1 - ^/e} with total probability > 1 - ^ 

i 

Sbad = {k ■ ■ ■ in\'^PiFii2...i„ < 1 - \/e} with total probability < y/e 

i 

i.e. Sgood are those extensions of Ii which retain high fidelity for reproducing the first slot 
after coding/decoding of the extension. Now 

/^l good bad 

For good sequences, lemma |31 then gives xi^i ^ fiV^) (as fidelity of the coding/decoding is 
> 1 — ^/e) where / is a function satisfying /(x) ^ as x ^ 0. For bad sequences we always 
have Xijti ^ log k, where k is the dimension of the one-signal space. This is because for each 
value of 1^1 the ensemble {piii2...in'iPii}ii is obtained by a CPTP map from {|o"ji) ;Pii} so 
from the Uhlmann-Lindblad monotonicity theorem we get Xi^i ^ xi^i) ^ log/c. 
Hence from the weights of the good and bad sets we get 

Xi^, < (1 - Ve)f{Ve) + Velogk^ g{e) 
where clearly g((e) — > as e — > 0. ■. 



7 Completing the main proofs 

Finally we assemble our lemmas to provide proofs of theorems ^ and [21 

Proof of theorem^Suppose that £ can be compressed to a qubits/signal plus auxiliary 
classical storage. Then for each e > and all sufficiently large n there is an encoding- 
decoding scheme which, by lemma |21 satisfies 

a + ^^^>S-f{e). (25) 
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Here /(e) — > as e — > and by lemma (with £i = £ for all i) we have I{I : J)/n —> as 
e ^ too. But eq. holds for all e > so if a 7^ we must have a > S. 

Conversely if a > S" then S may be compressed to a qubits/signal using just standard 
Schumacher compression (and no auxiliary classical storage). ■ 

For the proof of theorem I^J we will use the following standard result: 

Lemma 6 (Lemma of typical sequences^ \1 6] / Let V = {pi, . . . ,pl} be any probabil- 
ity distribution and consider sequences ii . . . i„ of the symbols 1,...,L with probabilities 
Pii ■ ■ - Pin- n{i) be the number of times that the symbol i occurs in the sequence. For 
any e > and n let 5„(e) = {ii . . . i„ : \n{i) — npi\ < L^}. Then for any e > there is 
an 77-0 such that for all n > hq the total probability of ^^(e) is greater than 1 — e. For such 
sufficiently large n, Sn{e) is called a set of e -typical sequences. 

Thus a sequence is typical if the frequency of occurrence of each symbol i in it is 
approximately equal to the prior probability pi. 

Proof of theorem |2] We have a signal ensemble £ = U^i c^iSi where £1 are irreducible 
ensembles supported in orthogonal subspaces. Let A denote the probability distribution 
{ai, . . .,aL}. 

Suppose that £ can be compressed into a qubits per signal plus auxiliary classical 
storage. Then for each e > and all sufficiently large n there is an encoding-decoding 
scheme for n-strings with fidelity F > 1 — e and supp = a. 

The source of all n-strings from £ decomposes into irreducible parts: 

= U ai,... aiji, ^...^£i„ 

and the fidelity may be expressed as an average: 

F = ai^...ai^Fi^,„l^ > 1-e 

h...ln 

where Fi^^ i^ is the fidelity of the scheme when restricted to iSj^ . . . £i^. We will apply 
lemmas 121 and El to these irreducible parts. For any sequence h . . .In let n{l) denote the 
number of times that the symbol I = 1, . . . L occurs. Then the von Neumann entropy of 
£i^<^ ... ®£i„ is Y.in{l)Si. 

By applying the Markov lemma to the random variable {Fi^ i^] ai^ . . . a;^} we obtain a 
set of sequences 

Sgood = {h-.-ln'- -Pii-./n > 1 - Ve} with total probability > 1 - ^ 
By selecting the e-typical subset of these we get 

Sgood,typ = {h...ln- Fi^...i„ > 1 - \/e and h... In is e-typical} 
with total probability > 1 — 2^/e. 
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Consider the compression scheme acting on the irreducible component <S;^ (g) . . . (g) £i^^ . 
For any one of the good sequences lemma |21 gives 

na+X(4..,„ : J) > Y,n{l)Si - nf{^e) 
I 

where : J) is the mutual information for the source of restricted n-strings and 

f{x) ^ as X ^ 0. Furthermore lemma El gives Z{Ii^ : J)/n < g{\/e) where g{x) — > 
as a; ^ too. Hence 



If our chosen good sequence is also typical then 



L nil) L 
ai < <ai + 



so for each fixed e, — > a/ as n ^ oo. Thus 

a > ^ aiSi - /(^/e) - g{^/e) 
I 

and finally letting e ^ we get a > 0/5"/ as required. 

Conversely to see that the bound is tight let e > and 5 > be any chosen values and 
let a = aiSi + 6. For all sufficiently large n we encode an n-string from £'^'^ as follows. 
We first measure each signal (without disturbance) to determine which sub-ensemble £i it 
belongs to, giving a string li . . .1^ drawn from A^. If the sequence /i . . . is e-typical for A 
(so each value / occurs between ainziz L^/n/ y/e times) we perform Schumacher compression 
to naiSi + 0{^/n) qubits for each value of I, giving aiSi + 0(-^) qubits/signal overall. If 
the string is atypical we generate an arbitrary fixed state of na qubits. By the dominating 
weight of typical sequences and the asymptotic fidelity of Schumacher compression, this 
scheme clearly has fidelity 1 — 0(e) and for all sufficiently large n, the quantum resource 
will be less than a = ^20-1^1 + S qubits/signal. ■ 



8 Concluding remarks 

We have shown that no part of the quantum information of an irreducible source may 
be replaced by classical information if arbitrarily long strings are to be reconstitutable 
with asymptotically high fidelity 1 — e for all e > 0. Also for reducible sources we have 
characterised the maximum possible amount of classical information that can be reversibly 
extracted from the source under the above conditions. 

To obtain these results we first proved some information-disturbance relations. Let £ 
be an irreducible source. If some mutual information 2 is obtained about the identity of 
the state from a single emission from £ by any physical process, leaving the state intact 
with average fidelity 1 — e then we showed that X ^ as e ^ 0. For strings of length n 
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from £ we considered a sequence of encoding-decoding schemes (labelled by m = 1,2,...) 
with asymptotically perfect fidelity (1 — 1) for which the string length n{m) may 

vary arbitrarily (e.g. grow unboundedly) with m. In this case we showed that the mutual 
information per letter Im/n{m) provided by the m*'^ scheme, must tend to zero as the 
fidelity tends to 1. This was sufficient for our purposes but raises the interesting question 
of a possibly stronger result: does Im itself necessarily tend to zero too, as 0, or 

can Ira remain nonzero under these conditions (while 2m/n — > 0)? Consider for example 
an irreducible source £ = {\cro) , |fTi) ;po;Pi} of two non-orthogonal states. Is it possible 
to have a sequence of encoding-decoding schemes {En,Dn) for strings of increasing length 
n, such that the n^^ scheme has average fidelity 1 — e„ with e„ — > and the n^^ scheme 
provides X„ = 1 bit of information about the identity of the string? Here X„/n — > so our 
result is not contradicted yet a nonzero amount of information about the whole string is 
obtained with vanishing disturbance to the states. This question remains open. 
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Appendix A 

Proposition 2 Let £i = {iVk]Pk} and £2 = {pk]Pk} be two ensembles on the same space, of 
dimension d, with the same prior probabilities. Here ujk and pk are generally mixed states. 
Let xi a,nd X2 be their Holevo quantities. Suppose 

F = Y1 PkF{LOk, Pk) = l-€ 
k 

with e < 1/16. Then 

\X1 - X2I < 4(V^logd - ^log(2V^) (26) 

Proof In 7 it is shown that 

IX1-X2I <2(e'log(i-e'loge') (27) 

where ^ 

^Pillwj - pi\\ = e' <- 

i 

and \ \uj\\ denotes the trace norm (i.e. the sum of the absolute values of the eigenvalues) of 
oj. This norm is related to our fidelity function by 

\\p-oj\\< 2^1- F{p,oj). 
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Let efc = 1 - F{pk,uJk)- Then 

e' = ^PkW^k - PkW < 'i'^PkVe^ < 2y^Pfcefc = 2Ve- (28) 

Thus if e < we have e' < ^ and eqs. (|27|) and (|28|) give the required inequahty eq. (|26|) . 
■ 

We note that a shghtly weaker form of eq. H26|) is proved by different means in jS] . 

Appendix B 

Proof of lemma 13] Let r] be the smahest non-zero overlap Kcilcj)] of any two signals 
|c7j) and |cjj). We have 

r : Sq, (g) Sqi O 

mapping |cri)^ |0)^ to l^i)^^. Write 

o-i = tr B • 

The average fidelity is 

F = Y,p,F{\ai),a,) = l-e. 

i 

For each i let F{\ai) , ai) = 1 — €{ so that 

EPiei = e (29) 

i 

Since we have £ fixed but may think of e as varying, we have = 0(e) for each i. 

For each value of i we consider an orthonormal basis of register A which has \ai) as its 
first member: 

iWi) An) ,\t2) . .}. 

Since has fidelity 1 — to be \ai) in register A, we can write: 

\^i)AB = VI - \cri)A + V^ihi)AB (30) 

where the normalised state 17?,)^^ has the form 

It*) = ^ am \Tm) \Sm) (31) 
m>l 

and , , \52) , • • • are some normalised states of B (which generally all vary with i). 
For any other value A; of i we have correspondingly 

\^k)AB = VI - efc Wk)A I A) B + V^ hk)AB ■ (32) 

Our strategy is the following: thinking of and as small we note that the 17)^^ terms 
have small amplitude and we will now argue that the states and are then also close. 
Hence the reduced states in register B for different values of i will be almost independent 
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of i and hence will have very low x- Correspondingly any measurement on B can provide 
only very little information about the identity of i. For notational clarity we will sometimes 
write the product state \a) of registers AB as |a/3). 

The unitarity of T with the expressions eqs. (|3U() and H32|) gives 



Vl - eiV^k {cTiPil 7fe) + Vl - efc\/ei (7i| ^^kPk) + \/ei\/efc (7^l7fc)• 



(33) 
(34) 



Each inner product in the last three terms is a complex number with modulus at most one. 
Let us denote them by 01,02 and 03 and write fi = {(Ti\ak)- Then 



Now if e is sufficiently small we will have 



1 

<3 



and 



1 

<3 



(35) 



and recalling that |aj| < 1 we have 

(\/l - eiy^ai + VI - «2 + V^V^'^s) 



^ (yi - ei v^ + VI - gfc v^ + v^iVek) ^ ^ 

Also V^V^ — ^"2^^ ^iid /X > ?7 for all non-orthogonal |cjj) and Icja;). Putting all this 
together with eq. H35|) we see that for all non-orthogonal pairs \ai) and |(Tfc) we have 



2r] 



and note that C = 0(Ve)- Hence we can write 



\(3k) = (1 - C) I A) + \/2C - I Ai) 



(36) 



(37) 



where (/3i|/3,i) = and C = 0(Ve) so ^ |A) as e ^ 0. 
Recalling eq. (|3n|) and using eq. pT]) in eq. p2]) we have 

iCi) = Vl-eiki) Ift) + Ve^l7i) 

= Vr^k^)|ft)+0(V^) (38) 

iCfc) = vr^(i - c) Wk) m + v^V(2 - ovr^kfc) + v^i7fc> 

(39) 



Now write 
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Let us modify into |.^^) by replacing the basis {|crj) , |ti) , |t2) , . . .} in eqs. (|3Ujl and 

by the corresponding basis {|<Tfc) , \t[) , jrg) , . . .} used in the expression for {S^^)- Then 1^^) 

is a purification of 0,^ and |^|) is still a purification of Jlj. Thus for all ((Tj|o"fc) 7^ 0, we have 

Now £ with ii' states is irreducible so by lemma 7 there is a chain of length at most K from 
IfTjp) to every other \(Tj). Hence for every j, is near to any chosen fij,, in the following 
sense: 

F{Qi„n,)>l-KO{Ve). (40) 

We now compare the constant ensemble Sconst = {^i^'iPi} having xi^const) = 0, with the 
actual ensemble £ = {Qi;pi} of reduced states in register B arising from T, having xi^) = X- 
From eq. (|4flj) we have 

F{£,onst,£) = l-KO{V~e) 

and the result in appendix A gives |x~0| ^ f{^) where /(e) has the form A^/e + B^/elog^/e 
for suitable constants A and B, and /(e) — > when e — > 0, as required. ■ 

Appendix C 

We use the notation / to denote the index string / = ii . . . i„ and I^k to denote the 
string / with the k^^ position deleted. We aim to prove: 

Lemma 7 Let pi = pf"^ . . . he any product distribution of n probability distributions 
and let {pr,pi} be any associated ensemble of quantum states. Then 



x{{pi]Pl}) < nmax^p/^^ 



(41) 



We begin by defining the conditional and mutual entropies for a quantum state gabc on 
three systems vl, B and C: 

S{A\B) = S{A,B)-S{B) (42) 
S{A:B) = S{A) + S{B)-S{A,B) (43) 
S{A:B\C) = S{A\C) + S{B\C)-S{A,B\C). (44) 

Here S{A), S{A, B) etc. denote the von Neumann entropies of the states of the designated 
subsystems, obtained by partial trace from (Jabc- The following chain rules for conditional 
and mutual entropies are then simple consequences of the definitions: 

S{A^,A2,...,An\B) = Y.S{Ak\A<k,B) (45) 

k 

S{AuA2,...,An:B) = S{Ak : B\A<k). (46) 

k 
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(where A^k denotes the Ust Ai, . . . , Ak-i). Now suppose we are given a state crAiA2...AnB 
such that a'AiA2...A„ = ■ ■ ■ ^A^- We can calculate, for example, that 

S{A2:B\Ar) = S{A2\Ai) + S{B\Ai)-S{A2,B\Ai) (47) 
= SiA^) + S{A^,B)-S{A^)-S{AuA2,B) + SiAi) (48) 
= S{A2:Ai,B). (49) 

This relationship can then be used to prove an upper bound on the joint quantum mutual 
entropy as in the proposition below. This bound is a quantum analogue of a classical mutual 
information inequality given in jl7j . 

Proposition 3 For a state (yAxA2...A„B such that crAiA2...A„ = • • • '^crA„ the following 
inequality holds: 

S{Ai,A2,...,An:B)<nmaxS{Ak:A^k,B). (50) 

k 

Proof 



Y,S{Ak:B\A^k) 

k 


(51) 


J2S{Ak:A^k,B) 

k 


(52) 


Y,S{Ak : tr ykA^k,B) 

k 


(53) 


Y,S{Ak:A^k,B) 

k 


(54) 


nmax5(j4fc : A-^k,B). 

k 


(55) 



The next to last inequality follows from the strong subadditivity of von Neumann entropy, 
which implies that the quantum mutual entropy cannot increase under any CPTP map. ■ 

To obtain our desired inequality for X) we specialize to the case where A is a classical 
system correlated with B. Let 

aAiA2...AnB = ^Pi\I){I\a'^ Pb (56) 
I 

where I = iii2 Pi = 11^=1^1^^ ^^'^ \^) = ®fc=i Na;)^^ ^^^s of orthogonal states 

{\ik)A^.}- Then a straightforward calculation gives 

SiA,,A2,...,A^:B) = SiJ2prpi)-Y.piS{pi) (57) 

= xiipnpi}) (58) 



and applying the proposition gives 

x{{pi;Pl}) < nmaxJ^PL 

as required. 



(59) 
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